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Abstract 

In this letter we propose the use of physics techniques for entropy 
determination on constrained parameter optimization problems. The 
main feature of such techniques, the construction of an unbiased walk 
on energy space, suggests their use on the quest for optimal solutions 
of an optimization problem. Moreover, the entropy, and its associ- 
ated density of states, give us information concerning the feasibility 
of solutions. 

Statistical physicists are constantly developing new computational and 
theoretical tools to unravel the complex behavior of systems composed by 
many simple, interacting units. In the last years, many works [|l| have 
showed us that one could "translate" a variety of problems, ranging from 
biology to economy, to a physicist's language. One such example is the work 
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of Rose et. al., who extended the concept of "density of states" to complex 
optimization problems 0, |^. 

This paper follows the same guidelines: we propose the use of methods 
which obtain the entropy of physical systems to study constrained parameter 
optimization problems. These include important tasks, such as production 
planning, decision problems, structural optimization |^ and design of semi- 
conductor quantum devices f^. From the operational point of view these 
problems are usually analytically untractable and numerically hard to solve, 
mainly due to the intricate shape of the space of feasible solutions. 

A general constrained optimization problem can be formulated as follows: 

Minimize (or Maximize) E{x) (1) 
Subject to Qii^) < 0, i=l,...,q 

hi{x) = 0, i=q+l,...,m 

where x = {xi,X2,X3, is an n-component vector, gi{x) are q inequality 

constraints and hi{x) are m — q equality constraints. The functions E{x), 
gi{x) and hi{x) can be either linear or nonlinear, continuous or discontinuous. 
Feasible solutions can be defined as the set y4({x}) of vectors which satisfy 
all the constraints, and optimal solutions as the subset B & A oi the feasible 
ones which minimize (maximize) the cost E. 

Although there are many well-established heuristic algorithms for such 
problems, like evolutionary algorithms ^ and Constrained Simulated 
Annealing0, ^, none of them give, in a simple way, any information about 
the structure of the problem. 

The "translation" of the abovementioned problem to a physicist's lan- 
guage proceeds as follows: the variables Xi of the optimization problem (|^) 
are identified with the simple interacting units (pointed out in the first para- 
graph) and the vector x with a physical state. E{x) is the energy (or cost) of 
the state x. The density of states is here identified with the ratio between the 
number of solutions with a given cost E and the total number of solutions. 
Mathematically it reads 

S(E) = ^il^f^ (2) 

where the summation runs over all feasible solutions. The entropy S{E), in 
units of ks, is defined as S{E) = In g{E). 
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As pointed by H. Rose [0, independently of the complexity of the space of 
solutions, the density of states is a direct measure which reflects how sparsely 
states with a given quality E are distributed. Thus, from the density of states 
(or, equivalently, the entropy S{E)) we can estimate whether solutions with a 
given cost E represent global optimal values or if further optimization could 
lead to better results with reasonable computational effort]^. Unfortunately, 
it is not a simple task to obtain the entropy or to estimate it from traditional 
simulations (Metropolis importance sampling, microcanonical simulations, 
etc..) IP, Over the past few years, many efficient algorithms have been 
developed aiming at an efficient calculation of this quantity. Those include 
multicanonical methods the "Broad Histogram Method" [l^, |l^ and. 



more recently, the "Multiple Range Random Walk" algorithm (MRRW) [|I^]. 
Since the latter seems to be the most simple to implement and easiest to 
generalize [0, , we adopted it as a calculational tool to study optimization 
problems on this letter, and before applying it to our problem, let us briefly 
review its most important points. 

The main idea of the MRRW algorithm is to obtain the density of states 
(DOS) recursively. For a good resolution on the DOS, one must visit the 
energy axis on a non-biased way, what can be done by performing a random 
walk in energy space with the probability of visiting a state x being propor- 
tional to the reciprocal of the density of states, l/g{E). Since g{E) is not 
known a priori, it is set to g{E) = 1 for all E at the beginning of the random 
walk, and then configurations are randomly chosen. If a given choice changes 
the energy of the system from Ei to E2, the new configuration is accepted 
according to 

p(£.^£,)=min(^.l). (3) 

Then, after the energy level E is visited, g{E) is updated on a multiplicative 
fashion, g{E) g{E) x /, and the histogram of visits H{E) on an additive 
way, H{E) H{E) + 1. When fiuctuations on H{E) are sufficiently small 
(the magnitude of / is related to the "flatness" of the histogram), the mul- 
tiplicative factor / is decreased to /' = \/7- This sequence is repeated until 
a predeflned value of f is achieved (for more details, we refer to the original 
paper ||T^ . 

In principle, this prescription for a "flat histogram" assures that all energy 
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levels are visited equally and, in practice, one expects that the extreme values 
of the accessible energies will be visited with the same probability as any other 
energy level. Since on optimization problems one is interested in extremal 
values of a cost function (or energy) which characterizes the best solution 
of a given problem, is natural to think that "flat histogram" algorithms are 
good candidates for solving optimization tasks, as we will show on the rest 
of the paper. 

A great advantage of these methods over traditional techniques is that 
there are no restrictions on how one should make changes to the state x, 
as long as the new state sampled is accepted according to the transitional 
probability @. So, this choice can always be made such that it handles 
efficiently the constraints of the problem []TB| . 



Let us take simple test parabola E{x) = x^ on the interval 

< X < 1, with X being a random variable. From the analysis of functions 
of random variables , one can obtain the probability distribution func- 
tion P{E)dE, from which the entropy S{E) can determined. In this simple 
case it reads S{E) = —lnE/2. To recover this entropy with the MRRW 
algorithm we simply need to initialize our "system" with an initial "state", 
for instance x = 0.5, then choose randomly another "state" x'. The latter is 
accepted according to the transitional probability (H), and then histograms 
are updated. After accomplishing all steps of the method, |]18|, we compare 
our numerical estimate with the analytical result (fig. 1) and find a root 
mean square deviation of order 10~^. 

Once we verified the correctness of the method, let us focus on harder 
problems: 



1) Find the maximum of 



subject to WLi^i > 0.75, E7=iXi < 7.5n and < Xi < 10. 



(4) 
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Figure 1: Comparison between numerical and analytical results for the en- 
tropy of the function E = of a random variable < a; < 1. 

2) Find the maximum of 

n 

E2ix) = [VEyHxi (5) 

i=l 

subject to Er=i = l-O- 

These are well-known test functions for optimization algorithms [p^ 



whereas the optimal solution for the first problem is not known exactly, 
while for the second one the maximum is at = 1. 

Following the observation that very often the global solution of many con- 
strained numerical optimization problems lies on the boundary of the feasible 
region 0, we fix the search to the "edge of feasibility". For instance, in the 
first example we fix the condition Jjxi = 0.75 and perform simultaneous 
changes to two variables Xi and xj at each change (that is, modify Xi by a 
random variable < g < 2 and divide Xj by the same q). Doing so, we 
obtain the entropy of the former problem, which is depicted in figure (2) for 
the case n = 20. 
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Problem 


n 


MRRW 


EA 


i 




CSA 


m 




El 
El 
El 
E2 


20 
50 
100 
20 


0.803587 
0.835131 
0.845388 
0.999881 


0.803553 
0.833194 

0.999866 


0.803619 
1.0 



Table 1: Comparison between numerical results obtained with multiple range 
random walks (MRRW), evolutionary algorithms (EA) and constrained sim- 
ulated annealing (CSA). 



On table / we compare our results with those obtained with evolutionary 
algorithms (EA) |^ and constrained simulated annealing (CSA) |^. We 
spent approximately 10^ MC steps to obtain the entropies for both problems 
with a nice resolution (which corresponds to 222 seconds for problem 1 with 
n = 20 and 77 seconds for problem 2, both running on a pentium II 400 
MHz), while the optimal solutions are obtained in approximately 10 times 
less MC steps. These results were obtained from only one simulation, and 
could, in principle, be improved by increasing the number of MC steps or 
by employing a steepest descent method about the minimum found . 
Our numerical estimates are always better than those obtained with genetic 
algorithms, and can get as good as constrained simulated annealing ones with 
this additional (local) search pO . 



As a final test, we apply the MRRW algorithm to the traveling salesman 
problem (TSP), an archetypal problem on computer science and one of the 
six basic NP-complete problems|^l|. It can be formulated as the quest for 
the shortest path which connects cities displaced on a plane map. We try 
to find optimal tours for two particular instances of the problem, known as 
ATT48 and kroAlOO, for which there are known solutions, mathematically 
proven to be optimal [^. In order to visit the space of states (the space 
of feasible solutions) efficiently, we use the Lin Kerninghan algorithm ^3 



to sample between valid tours. We were able to find optimal tours with 
approximately 10^ MC steps in both instances. On figure (3) we show the 
entropy of the kroAlOO instance with the optimal tour on the inset. From 
the entropy, one can directly assess how difficult it is to find tours with a 



specified "quality" , or perimeter P |2^ 
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Figure 2: Entropy as a function of the energy E — Ei when HILi — 
0.75, n = 20. 

The aim of this paper was to propose the use of a simple and efficient 
algorithm for entropy determination of physical systems on constrained pa- 
rameter optimization problems. By performing a biased walk on configura- 
tion space, one is able to obtain an unbiased random walk on energy space 
and then to reconstruct with good precision the density of states. This is 
the property which makes this "flat histogram" algorithm a good tool for 
optimization tasks, since the extreme energies of the system are visited at 
the same frequency as other energies during the visitation scheme of the al- 
gorithm. As a result, we were able to obtain, with reduced computational 
efforts, satisfactory results for classical optimization problems. As an addi- 
tional feature, we gain insight into the complexity of the problem by means 
of its associated entropy. 
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Figure 3: Entropy of the kroAlOO instance. With the MRRW algorithm we 
were able to find the tour with shortest perimeter P, which is exposed on 
the inset. 
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